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■ Standard tests for nonlinearity reject the null hypothesis of a Gaussian linear pro- 

cess whenever the data is non-stationary. Thus, they are not appropriate to distin- 
guish non-linearity from non- stationarity. We address the problem of generating 
5"H proper surrogate data corresponding to the null hypothesis of an ARMA process 

with slowly varying coefficients. 

1 Introduction 

Most methods used in the field of linear and nonlinear time series analysis assume 
stationarity of the considered data. Non-stationarity is very likely to lead to wrong 
(T) ' results. This is especially true for tests for nonlinearity. A common approach is to 

04 . split the time series into segments which can be considered nearly stationary and 

perform individual tests. But for short time series or not too slowly varying non- 
stationarities these segments have to be made too short to meaningfully calculate 
a test statistic on them. 
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2 Surrogate Data 

' , To generate surrogate data that has the same time variation of autocorrela- 

tions than the original data .one can in principle follow the general approach 
C"| I of constrained randomization!!! The corresponding cost function would be the 

O ■ discrepancy between the autocorrelations of the original and the surrogate data 

in sliding windows. Autocorrelations should be calculated up to sufficiently high 
lags. The most striking disadvantage of this method is the extreme computational 
burden. Therefore, we use an alternative method: 
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1. split up the time series into segments 

2. generate a surrogate for each segment 

3. join the segments to one new surrogate 

For step || we use an iterative algorithm]!! The size of the segments is typically 
too small to perform individual tests for nonlinearity on them, because most test 
statistics need sufficiently large data sets. A disadvantage of the method is the loss 
of correlations between the segments which can lead to a bias. 



3 Cyclostationary Processes 

The first class of processes we consider are cyclostationary, having periodically 
varying parameters. These processes have been found to yield rejections of the null 
hypothesisci when using a test statistic derived from the correlation sum. Here we 
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Figure 1. Size of nonlinearity tests with varying segment sizes. The modulation period of the 
non— stationarity is fixed to T mo d = 2000. From upper to lower curve the values A =2.0, 1.5, 1.0 
and 0.0 are used. 
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use the maximum likelihood estimator of the correlation dimensionQEfu 

d ml = ^ C(e ) (1) 

jC(e)/e de 
o 

Following TimmeJl, the considered time series is generated by an AR(2)-process 

X n = CllXn-l + a-i.Xn-1 + Vn, Tl = 1, . . . , 2000 (2) 

with coefficients chosen to be 

a x = 2cos(27r/T)exp(-l/r) (3) 
az = — cxp(— 2/r) 

where T is the period and r the relaxation time of a damped oscillator. We will use 
T = 7 and t = 50. Non-stationarity is introduced by a modulation of the period T 

T(n) = T mcan + J 4sm(27r/T mod n) . (4) 

By varying A and r mo d the degree and time scale of the non-stationarity can be 
changed which is reflected in non-constant running variances and autocorrelations. 

3.1 Testing the test size 

The fraction of tests that falsely reject the nullJiypothesis, when it is in fact true, 
is called the size of the test. Since a single tesm says not much about the size, we 
performed 1000 one-sided tests with 9 surrogates for each process. A correct test 
with proper surrogates should yield the nominal size a — 0.1. 

For the first tests we used T mo d = 2000 for the modulation period. As shown in 
Fig. g the new method does indeed yield the correct size of the test for a segment 
size of about T moc i/2 = 1000. Only for relatively strong non-stationarity (^4 = 2.0) 
smaller segments seem to be necessary. If we reduce the time scale of the non- 
stationarity by setting T mo( j = 1000 we get sizes shown in Fig. ^ As expected, 
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Figure 2. Size of nonlinearity tests with varying segment sizes. Now T mo< j = 1000 and A = 2.0 
are used. 

only segment sizes which are smaller than T mo d can decrease the number of false 
rejections. 

4 AR(1)— Processes 

We found cases in which non-stationarity does not yield a higher rate of rejections 
of the null hypothesis but just the opposite. Consider an AR(l)-process 

x n = aix n -i + r\ n (5) 

with a nonlinear measurement function M(-) 

M(x n ) = x n y/\x n \ . (6) 

Non-stationarity is introduced by varying a\ fromJ).5 to 0.9 along the time series. 
We now choose a prediction error as test statistical It is based on locally constant 
predictions in an m-dimensional embedding space. 

Again we performed 1000 one-sided tests with 9 surrogates to estimate the size. 
Additionally we calculated the rank of the original data within the surrogates going 
from 1 if the prediction error of the original data set was lower than for all surrogates 
up to 10 if the original data yields the largest prediction error. For usual stationary 
surrogates we get the distribution presented on the left side of Fig. |[ In 35% of the 
tests the original data has a larger prediction error than all the surrogates and the 
null is rejected in only 2.3% of the tests. This is the opposite of what we expect for 
non- linear signals, which should be better predictable. Here, predictions are easier 
for the stationary surrogates with constant autocorrelations than for the original 
non-stationary data. Prediction error is not a useful discriminating statistic in this 
case. 

On the right side of Fig. || the results for tests with the new surrogates and 
different segment sizes are shown. For a segment size of 1000 we get the correct size 
and therefore a uniform distribution. With decreasing segment sizes the prediction 
error of the surrogates gets larger than for the original data. The predictions seem 
to be very sensitive to the autocorrelations that get lost between the segments. 
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Figure 3. Distributions of the rank of the original data within the surrogates for stationary sur- 
rogates (left) and surrogates with segment sizes 1000 (solid) and 250 (dashed). 



5 IV. Summary 

Dealing with non-stationary signals is still a difficult business. We have given 
numerical evidence that in a test for nonlinearity of a data set with possible non- 
stationarity in form of slow variation it may be feasible to generate surrogates on 
separate segments. These segments can be made smaller than would be necessary 
for individual nonlinearity tests. But the segment size and the test statistic should 
still be chosen with great care, since no general theory guarantees the correctness 
of the test. As a final remark we want to stress that the reasoning in this work is 
not applicable to sudden changes like jumps or spikes. 
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